arXiv: 1508.04216vl [math.ST] 18 Aug 2015 


The Annals of Statistics 
2015, Vol. 43, No. 4, 1831-1864 
DOI: 10.1214/15-AOS1327 

(c) Institute of Mathematical Statistics, 2015 


JUMP ACTIVITY ESTIMATION FOR PURE-JUMP 
SEMIMARTINGALES VIA SELF-NORMALIZED STATISTICS^ 

By Viktor Todorov 

Northwestern University 

We derive a nonparametric estimator of the jump-activity index 
j3 oi a. “locally-stable” pure-jump Ito semimartingale from discrete 
observations of the process on a fixed time interval with mesh of the 
observation grid shrinking to zero. The estimator is based on the em¬ 
pirical characteristic function of the increments of the process scaled 
by local power variations formed from blocks of increments spanning 
shrinking time intervals preceding the increments to be scaled. The 
scaling serves two purposes: (1) it controls for the time variation in the 
jump compensator around zero, and (2) it ensures self-normalization, 
that is, that the limit of the characteristic function-based estimator 
converges to a nondegenerate limit which depends only on j3. The 
proposed estimator leads to nontrivial efficiency gains over existing 
estimators based on power variations. In the Levy case, the asymp¬ 
totic variance decreases multiple times for higher values of /?. The 
limiting asymptotic variance of the proposed estimator, unlike that 
of the existing power variation based estimators, is constant. This 
leads to further efficiency gains in the case when the characteristics 
of the semimartingale are stochastic. Finally, in the limiting case of 
/3 = 2, which corresponds to jump-diffusion, our estimator of (5 can 
achieve a faster rate than existing estimators. 

1. Introduction. In this paper we are interested in estimating the jump 
activity index of a process defined on a filtered probability space 
(Jj)t>o,P) and given by 

( 1 . 1 ) dXt = atdt + at-dLt + dYt, 

when L is locally stable pure-jump Levy process (i.e., a pure-jump Levy pro¬ 
cess whose Levy measure around zero behaves like that of a stable process) 
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and y is a pure-jump process which is “dominated” at high-frequencies by 
L in a sense which is made precise below; see Assumption A. All formal 
conditions for X are given in Section 2. The jump activity index of A on a 
given fixed time interval is the infimum of the set of powers p for which the 
sum of pth absolute moments of the jumps is finite. Provided a does not 
vanish on the interval and has cadlag paths, the jump activity index of X 
coincides with the Blumenthal-Getoor index of the driving Levy process L 
(recall V is dominated by L at high frequencies). The dominant role of L 
at high frequencies, together with its stable-like Levy measure around zero, 
manifests into the following limiting behavior at high frequencies: 


(1.2) h-^/^(Xt+sh-Xt)^atX (St+s-St) as/i^0andse[0,l] 


for every t and where S is ;0-stable process, with the convergence being for 
the Skorokhod topology. Equation (1.2) holds when /3 > 1 which is the case 
we consider in this paper. (When /3 < 1 the drift will be the “dominant” 
component at high-frequencies, and some of our results can be extended 
to this case as well.) We study estimation of from discrete equidistant 
observations of X on a fixed time interval with mesh of the observation grid 
shrinking to zero. 

Estimation of the jump activity index has received a lot of attention re¬ 
cently. [20] consider estimation from low-frequency observations in the set¬ 
ting of Levy processes. [4] and [6] consider estimation from low-frequency 
data in the setting of time-changed Levy processes with an independent 
time-change process. [2] consider estimation from low-frequency and op¬ 
tions data. [3] and [5] consider estimation from low frequency data in certain 
stochastic volatility models. [27-29] propose estimation from high-frequency 
data using power variations in a pure-jump setting. [1] and [16] consider es¬ 
timation in high-frequency setting when the underlying process can contain 
a continuous martingale via truncated power variations. [23] propose esti¬ 
mation of the jump activity index in pure-jump setting via power variations 
with adaptively chosen optimal power. [22] extend [23] via power variations 
of differenced increments which provide further robustness and efficiency 
gains. [15] consider jump activity estimation from noisy high-frequency data. 

The estimation of (3 from high-frequency data, thus far, makes use of the 
dependence of the scaling factor of the high-frequency increments in (1.2) 
on /?. For example, consider the power variation 


n 




2 = 1 


(1.3) 


1 


p > 0. 


n 
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Under certain technical conditions, (1.2) implies 

f^V{p, An) ^ n [ \crs\^ds, 

Jo 

{2An)^-P/^V{p,2An)^fi ['wsl^ds, 

Jo 

where p is some constant. An estimate of /3 then can be simply formed as a 
nonlinear function of the ratio This makes inference for /3 possible 

despite the unknown process a. 

The limit result in (1.2), however, contains much more information about 
/3 than previously used in estimation. In particular, (1.2) implies that over a 
short interval of time the increments of X, conditional on a at the beginning 
of the interval, are approximately i.i.d. stable random variables. In this paper 
we propose a new estimator of /3 that utilizes this additional information in 
(1.2) and leads to significant efficiency gains over existent estimators based 
on high-frequency data. 

The key obstacle in utilizing the result in (1.2) in inference for /3 is the fact 
that the process a is unknown and time-varying. The idea of our method 
is to form a local estimator of a using a block of high-frequency increments 
with asymptotically shrinking time span via a localized version of (1.3). We 
then divide the high-frequency increments of X by the local estimator of a. 
The division achieves “self-normalization” in the following sense. First, the 
scale factor for the local estimator of a and the high-frequency increment of 
X are the same, and hence by taking the ratio, they cancel. Second, both 
the high-frequency increment of X and the local estimator of a are approxi¬ 
mately proportional to the value of a at the beginning of the high-frequency 
interval, and hence taking their ratio cancels the effect of the unknown a. 
The resulting scaled high-frequency increments are approximately i.i.d. sta¬ 
ble random variables, and we make inference for (3 via an analogue of the 
empirical characteristic function approach, which has been used in various 
other contexts; see, for example, [8] . 

After removing an asymptotic bias, the limit behavior of the empirical 
characteristic function of the scaled high-frequency increments is determined 
by two correlated normal random variables. One of them is due to the lim¬ 
iting behavior of the empirical characteristic function of the high-frequency 
increments scaled by the limit of the local power variation. The other is due 
to the error in estimating the local scale by the local realized power vari¬ 
ation. Importantly, because of the “self-normalization,” the J^-conditional 
asymptotic variance of the empirical characteristic function of the scaled 
high-frequency increments is not random but rather a constant that depends 
only on /? and the power p. This makes feasible inference very easy. 
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When comparing the new estimator with existing ones based on the power 
variation, we find nontrivial efficiency gains. There are two reasons for the 
efficiency gains. First, as we noted above, our estimator makes full use of 


the limiting result in (1.2) and not just the dependence of the scale of the 


high-frequency increments on (3, which is the case for existing ones. Second, 
by locally removing the effect of the time-varying a, we make the inference 
as if (T is constant; that is, the limit variance is the same, regardless of 
whether X is Levy or not. By contrast, the estimator based on the ratios 
of power variations is asymptotically mixed normal with J^-conditional vari- 



where constant on the interval [0,1]. That is, the presence of time-varying 
a decreases the precision of the power-variation based estimator of (3. 

The efficiency gains of our estimator are bigger for higher values of j3. In 
the limit case of (3 = 2, which corresponds to L being a Brownian motion, 
we show that our estimator can achieve a faster rate of convergence than 
the standard y/n rate for existing estimators. 

The rest of the paper is organized as follows. In Section 2 we introduce the 
setting. In Section 3 we construct our statistic, and in Section 4 we derive 
its limit behavior. In Section 5 we build on the developed limit theory and 
construct new estimators of the jump activity and derive their limit behavior. 
This section also shows the efficiency gains of the proposed jump activity 
estimators over existing ones. Section 6 deals with the limiting case of jump- 
diffusion. Sections 7 and 8 contain a Monte Carlo study and an empirical 
application, respectively. Proofs are in Section 9. 

2. Setting and assnmptions. We start with introducing the setting and 
stating the assumptions that we need for the results in the paper. We first 
recall that a Levy process L with the characteristic triplet (6, c, zz), with 
respect to truncation function k (Definition 11.2.3 in [14]), is a process with 
a characteristic function given by 


( 2 . 1 ) 



t > 0. 


In what follows we will always assume for simplicity that k{—x) = —k{x). 
Our assumption for the driving Levy process in (I.I) as well as the “residual” 
jump component Y is given in Assumption A. 
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Assumption A. L in (1.1) is a Levy process with characteristic triplet 
(0,0, u) for V a Levy measure with density given by 

(2.2) u(x) = 1^1^-Lu'(x), /3 g(0,2), 

where A > 0 and u'(x) is such that there exists xq > 0 with |u'(x)| < 
for |x| < xq and some /3' < (5. 

Y is an Ito semimartingale with the characteristic triplet ([14], Defini¬ 
tion 11.2.6) (/q Jjg k{x)uJ (dx) ds, 0, dt (8) I'Y(dx)) when (3' <1 and (0,0, dt (8) 
(dx)) otherwise, with Jj^(|x|^ A l)vY (dx) being locally bounded and 
predictable, for some arbitrarily small l> 0. 


Assumption A formalizes the sense in which Y is dominated at high fre¬ 
quencies by L: the activity index of Y is below that of L. We also stress that 
Y and L can have dependence. Therefore, as shown in [24], we can accom¬ 
modate in our setup time-changed Levy models, with absolute continuous 
time-change process, that have been extensively used in applied work. Fi¬ 
nally, we note that (2.2) restricts only the behavior of u around zero, and 
ly' is a signed measure. Therefore many parametric jump specifications out¬ 
side of the stable process are satisfied by Assumption A (e.g., the tempered 
stable process). We next state our assumption for the dynamics of a and a. 


Assumption B. The processes a and a are Ito semimartingales of the 
form 

at = ao+ b'^ds+ / k{ 6°‘{s, x))Jl{ds, dx) 

Jo Jo Je ~ 

+ [ k'{ 5°"{s, x))fi{ds, dx), 

Je ~ 

o't = o'o+ bgds+ / {s,x))Jl{ds,dx) 

Jo Jo Je ~ 

+ / k'{ 6°^{s, x))fi{ds, dx), 

Je ~ 


(2.3) 


where k'(x) = x — k{x), and: 

(a) and are strictly positive; 

(b) is Poisson measure on M+ x E, having arbitrary dependence with 
the jump measure of L, with compensator dt (8) X{dx) for some u-finite mea¬ 
sures A on 

(c) 6^{t,x) and d‘^{t,x) are predictable, left-continuous with right limits 
in t with \6°{t,x) \ + \ 6^{t,x)\ < 7 fc(x) for all t <Tk, where 7 fc(x) is a deter¬ 
ministic function on M with /ij(| 7 fc(x)|’'^''A l)A(dx) < oo for arbitrarily small 
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t > 0 and some 0 <r < (3, and Tk is a sequence of stopping times increasing 
to +oo; 

(d) 6" and 6°^ are ltd semimartingales having dynamics as in (2.3) with 
coefficients satisfying the analogues of conditions (b) and (c) above. 

We note that // does not need to coincide with the jump measure of L, and 
hence it allows for dependence between the processes a, a and L. This is of 
particular relevance for financial applications. For example, Assumption B 
is satisfied by the COGARCH model of [17] in which the jumps in a are 
proportional to the squared jumps in X. More generally, Assumption B is 
satisfied if, for example, {X,a,a) is modeled via a Levy-driven SDE, with 
each of the elements of the driving Levy process satisfying Assumption A. 


3. Construction of the self-normalized statistics. We continue next with 
the construction of our statistics. The estimation in the paper is based on 
observations of X at the equidistant grid times 0, ^,..., 1 with n —>■ oo, and 
we denote A„ = -. To minimize the effect of the drift in our statistics, we 
follow [22] and work with the first difference of the increments, X^X — 
where A^^X = X^/^ — for i = 1,..., re. The above difference 

of increments is purged from the drift in the Levy case, and in the general 
case the drift has a smaller asymptotic effect on it. For each A”A — A'^_^X, 
we need a local power variation estimate for the scale. It is constructed from 
a block of kn high-frequency increments, for some 1 < < re — 2, as follows: 

(3.1) vr{p) = ^ Yl \^1X-A^_,X\^, i = kn + 3,...,n. 

j=i-kr,-l 

Block-based local estimators of volatility have been also used in other con¬ 
texts in a high-frequency setting, for example, in [13] and [25]. The empirical 
characteristic function of the scaled differenced increments is given by 


(3.2) C^ip,u) = 


n — kr, —2 


E 

2 — /Cti+S 


COS u 


A^X-AUX 

(R-(p))l/p 


u G. 


We proceed with some notation needed for the limiting theory of C'^{p,u). 
Let Si , S 2 and S 3 be random variables corresponding to the values of three 
independent Levy processes at time 1, each of which with the characteristic 
triplet (0,0, re), for any truncation function k and where re has the density 
Then we denote = (Ej5i — , which does not depend on 

re, and we further use the shorthand notation for any 

re > 0 with Aj 3 being a (known) function of A and (5. Using Example 25.10 
in [21] and references therein, we have 


(3.3) 


Cp^0 — 


Al 

k'pA 


2PT{{l+p)/2)T{l-p/l3) 

VvrF(l-p/2) 


-P/p 
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which depends only on p and /3 but not on the scale parameter of the stable 
random variables and 82 - With this notation, we set 

(3.4) C{p,u,l3) = , uGM+, 

which will be the limit in probability of C'^{p,u). We finish with some more 
notation needed to describe the asymptotic variance of £^{p,u). First, we 
denote for some u G M+, 


(3.5) 


^i(p,u,/3) = (^cosT ^*''^^^, -C{p,u,l3), 


\S1-S2\p y 
) ’ 


^ 2 {p,u,/ 3 ) = cos 






^P,/5 


We then set for tt, u G M+ 

(3.6) Ei{p,u,v,/3) =E{Ci{p,u,l3)^[^i{p,v,/3)), z = 0,l 


and 


(3.7) 


P 

H{p, u, (3) = G{p, u, (3) (-Cp^i3U^ - - - iV 
\P P J 


4. Limit theory for C^{p, u). We start with convergence in probability. 


Theorem 1. Assume X satisfies Assumptions A and B for some f3 G 
(1,2) and f3' < (3. Let kn he a deterministic sequence satisfying kn x for 
some w G (0,1) . Then, for 0 <p < (3, we have 

(4.1) C'^{p,u)C{p,u,/3) asn—)-oo, 

locally uniformly in u£ M+. 


We note that we restrict /3 > 1; that is, we focus on the infinite varia¬ 
tion case. The above theorem will continue to hold for /3 < 1, but for the 
subsequent results about the limiting distribution of C^{p,u), we will need 
quite stringent additional restrictions in the case ,0 < 1. We do not pursue 
this here. The other conditions for the convergence in probability result are 
weak. The requirements for a and a for Theorem 1 to hold are actually 
much weaker than what is assumed in Assumption B, but for simplicity of 
exposition we keep Assumption B throughout. We note that for consistency, 
we have a lot of flexibility about the block size kn'- (1) —>■ oo so that we 
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consistently estimate the scale via V^{p) and (2) fc„/n —>• 0 so that the span 
of the block is asymptotically shrinking to zero, and therefore no bias is gen¬ 
erated due to the time variation of cr. In the case when X is a Levy process, 
the second condition is obviously not needed. 

To derive a central limit theorem (c.l.t.) for C'^{p,u), we will need to 
restrict the choice of kn more. We will assume kn/y/n^ 0, so that biases 
due to the time variation in cr, which are hard to feasibly estimate, are 
negligible. For such a choice of kn, however, an asymptotic bias due to the 
sampling error of V^{p) appears, and for stating a c.l.t., we need to consider 
the following bias-corrected estimator: 


C^{p,u,l3y 

(4.2) 


C^ip,u) 

- ■^\h{p,u, P){E^q’‘^\p,u,u, P) + 2Ef’^\p,u,u, P)). 

Kn ^ 


We state the c.l.t. for C'^{p,u,f3)' in the next theorem. 


Theorem 2. Assume X satisfies Assumptions A and B with (3 G (1,2) 
and fi' < ^, and that the power p and block size kn satisfy 


(4.3) 

(4.4) 




2 (/ 3 -/ 3 '; 


fi -1 

V —— <p< 




2 ’ 


Then, as n —>■ oo, we have 

(4.5) y/n(C^{p, u, fiY - C{p, u, fi)) -4 Zi{u) + G{p, u, fi)Z 2 {u), 


locally uniformly in ttGM+. Zi{u) and Z 2 {u) are two Gaussian processes 
with the following covariance structure: 


(4.6) E{Z{u)Z{v)) =Eo{p,u,v,/3)+ 2Ei{p,u,v,fi), u,v£R+, 

where Z{u) = {Zi{u),Z 2 {u))'. 

Let fi he an estimator of fi with j3 — j3 = Op{kny/Xn) as n —>■ oo. Then 


(4.7) u, h' - u, fi)') A 0, 


locally uniformly in M_|_. 


The conditions for the power p in (4.3) are exactly the same as in [22] for 
the analysis of the realized power variation, and they are relatively weak. 
For example, the condition p > will be always satisfied as soon as we 
pick power slightly above Moreover, this condition is not needed in the 
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case when X is a Levy process. Further, the condition in (4.4) for kn shows 
that we have more flexibility for the choice of kn whenever p is not very 
close to its upper bound of /3/2. 

Due to the self-normalization in the construction of our statistic, the lim¬ 
iting distribution in (4.5) is Gaussian and not mixed Gaussian, which is the 
case for most limit results in high-frequency asymptotics (and in particular 
for the power variation based estimator of /3); see [26] for another excep¬ 
tion. This is very convenient as the estimation of the asymptotic variance 
is straightforward. The bias correction in (4.2) is infeasible, as it depends 
on /3. However, (4.7) shows that a feasible version of the debiasing would 
work provided the initial estimator of /3 is Op{knV‘^n)- When one estimates 
/3 using £"'(p, u), with explicit estimators provided in the next section, (3 — (3 
will be Op{l/kn)- Hence, such a preliminary estimate of f3 will satisfy the 
required rate condition in Theorem 2. 


5. Jump activity estimation. We now use the limit theory developed 
above to form estimators of p. The simplest one is based on C'^{p,u) and is 
given by 


(5.1) 


'^^^{p,u,v) 


log{-log{C^{p,u)))-log{-log{C^{p,v))) 
log{u/v) 


for tt,u e R+ with u^v. Because of the asymptotic bias in £^{p,u), (3^^{p, 
u,v) — P will be only Op{l/kn), with p and kn satisfying (4.3)-(4.4). An 
explicit estimate of f3 using feasible debiasing is given by 


(5.2) 


Mp,u,v) 


iog{-iog{c^{p,u,p^y)) -iog{-\og{£^{p,v,p^y)) 

log{u/v) 


for some € M+ with and where (3^^ is a suitable initial estimator 

of (3 [like the one in (5.1)]. While convenient, the above estimators have 
two potential drawbacks. One, we do not take into account the information 
about f3 in the constant Cp^p. This is because in the asymptotic limit of the 
above estimators, Cp^p gets canceled. Second, u and v are chosen arbitrarily, 
and one can include more moment conditions for the estimation of [3 using 
£^{p, u, (3^^)'. In the next theorem we provide a general estimator of f3 which 
overcomes these drawbacks of the explicit estimators above. 


Theorem 3. Assume X satisfies Assumptions A and B with (3 £ (1,2) 
and (3' < (3/2, and that the conditions in (4-3) and (4-4) hold. Suppose (3^^ 
is a consistent estimator of f3 with (3^^ — (3 = Op{kn'/Xy). Denote with uj 
and u/j two sequences of K x l-dimensional vectors, for some finite K >1, 
satisfying u/ —)• u; and —)• u^, as oo, for some u;,u/j G with 
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ul < u\, uj < uj^ and {ul,u]^) H {uj,uj^) = 0 for every i,j = l,...,K with 
i j where u\ and u\ denote the ith element of the vectors u/ and u^, 
respeetively. Set further the shorthand u= [u/;u/i] and u= [ui;ufe]. 

Let W(p, u,/3) be K X K matrix with {i,j) element given by 

fK 

(5.3) W(p, u,/3)jj = / / w{p,u,v, ft) dudv, 

J uj 

“■ ”■ = Cip.nJmp.v.P) ( G(p)u, 0 )) 

where E{p,u,v,(3) = Eo{p,u,v,/3) + 2Ei{p,u,v, I3). 

Define the K x 1 vector m(p, u,/3-^®, u,/3) by 

^ r^h ^ ^ 

(5.4) ni{p,u,fiffiu,/3).= (}og{C^{p,u,/3f^y)-log{C{p,u,fi)))du, 

Jui 

for i = 1,..., K, and set 
fi{p, u) 

(5-5) ^ ^ ^ 

= argminm(p, u,u,/3)^W ^(p, u,/3-^®)m(p,u,u, ft). 
/ 36 ( 1 , 2 ) 

Finally define the K x 1 vector M(p, u,/3) by 

pufi 

(5.6) M{p,u,/3)i= V0log{C{p,u,l3))du, i = l,...,K. 

Ju\ 

Then for ft G (1, 2), p G (^(^frgTy, f) and fd' < /3/2, we have 

(5.7) Vn0{p, u) - /3) \/M(p, u, /3)'W"i (p, u, /3)M(p, u, fi) x J\f, 

for n ^ oo with J\f being standard normal random variable. 

A eonsistent estimator for the asymptotic variance of I3{p,u) is given by 

(5.8) M(p, u, ^)'(p, u, ^)M(p, u, fi), 

where M(p, u,/3) is defined as M(p, u,/3) with u and fi replaced by u and ft. 

Theorem 3 allows us to adaptively choose the range of u over which to 
match C'^{p,u, ft^^y with its limit. This is convenient because the limiting 
variance of i3"'(p, u, ft-^^)' depends on ft. For this reason also the weight func¬ 
tion in (5.3) optimally weighs the moment conditions in the estimation. We 
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discuss the practical issues regarding the construction of m(p, u,u,/?) 
in Section 7. 

We now illustrate the efficiency gains provided by the new method over 
existing power variation based estimators of /3. The power variation estima¬ 
tor based on the differenced increments is given by (see [22]) 

plog(2) 

log[VT(p) (p)] ’ 


Y,\^2x-a^_,x\p, 

i=2 
n 

- A^X + Ar_2W - 

i=A 

On Figure 1, we plot the limiting standard deviation of the estimators 
in (5.5) and (5.9) for different values of /3. [The estimator in (5.9) is de¬ 
rived under exactly the same assumptions for X as our estimator here.] 
The asymptotic standard deviation of f5{p) is computed from [22]. I3{p,u) 
is far less sensitive to the choice of p than /3(p), with lower powers yield¬ 
ing marginally more efficient I3{p,u). The news estimator I3{p,u) provides 
nontrivial efficiency gains irrespective of the values of p and j3. The gains 
are bigger for high values of the jump activity. For example, for /3 = 1.75, 
(3{p,u) is around two times more efficient (in terms of asymptotic standard 
deviation) than /3{p). 


(5.9) /3(p) 
where 

Kip) = 

(5.10) 

Kip) = 


6. The limiting case of jump-diffusion. So far our analysis has been for 
the pure-jump case of /3 G (1,2). We now look at the limiting case of /3 = 2, 
which corresponds to L in (1.1) being a Brownian motion. In this case the 
asymptotic behavior of the high-frequency increments in (1.2) holds with 
S being a Brownian motion. Thus deciding /3 = 2 versus /3 < 2 amounts to 
testing pure-jump versus jump-diffusion specification for X. It turns out that 
when /3 = 2, our estimation method can lead to a faster rate of convergence 
than the y/n rate we have seen for the case /3 G (1,2). This is unlike the 
power-variation based estimation methods for which the rate of convergence 
is y/n, both for /3 = 2 and /3 < 2; see, for example, [23]. 

The faster rate of convergence in the case (3 = 2 can be achieved by letting 
the argument u of the empirical characteristic function C{p,u) drift toward 


zero as n 


oo. In this case, ) ^^d ^ , for 


Cp2U: 


Cp,2P^ul 


some 


p > 0, are asymptotically perfectly correlated, and their difference converges 
at a faster rate. We note that this does not work in the pure-jump case of 
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Fig. 1. Asymptotic standard deviation of jump activity estimators. The straight line cor¬ 
responds to the asymptotic standard deviation of the characteristic function based estimator 
defined in (5.5) and the * line to the power variation based estimator of [22] given in (5.9) 
(when a is constant). For each cases of /3, the power p ranges in the interval p € (^, ||)/3. 
For the estimator in (5.5), the vector u; = [0.1 : 0.05 : 5] and = [0.15 : 0.05 : 5.05[. 


/3 < 2. To state the formal result we first introduce some notation. For 
5*2 and S 3 being independent standard normal random variables, we denote 


( 6 . 1 ) 


6(p) = 
6(p) = 


\Si-S2\^ 12 \S1-S2\P 




9 1 


F'p,2 


| 52 - 53|4 12 1^2 - 53 ^ 


//2 

F'p,2 


9 1 

f^p,2 


F'p ,2 


- 1 


- 1 


and then set = E(^i(p)^'j^_|_^(p)) for i = 0,1. The difference from the 

analogous expression for the case /3 < 2 is in the first terms of ^i{p) and 
^ 2 (p)- Note that the expression for the bias-correction remains exactly the 
same as it involves only the variance and covariance of the second elements 
of ^i(p) and ^ 2 ip), which remain the same as their pure-jump counterparts. 
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Theorem 4. Suppose X has dynamics given by (1.1) with L being a 
Brownian motion, Y satisfying the corresponding condition for it in As¬ 
sumption A and a and a satisfying Assumption B for some r <2. Suppose 
p < I, kny/Xn —)• 0 and Un —)• 0, and further 


^^//3'-p/2)A(p+l)/(rVl+l)-t ^ ^-(l/pA3/2)+t ^ 


( 6 . 2 ) 




(fe„A„)lM(2-p)/2-l/2 


0 . 




Then for some p> 0 

(6.3) P'^{p,un,pun)-2 = Op{k~^ul). 


Further, if for some initial estimator 


- 2 = Op{knuly/\(), then 


(6.4) 


n 


<(l-p2) 


(l3{jp,Un,pUn) - 2 ) 


'] —TT f Yat' —- Cp2Z2 

^og{p)\2ACpp p 


where Z\ and Z 2 are two zero-mean normal random variables with covari¬ 
ance given by Ho(p) + 2Ei{p). 

When X is a Levy process, the requirement for k^ and Un reduces to 


(6.5) 


Un ^ 0? 


^{p/p'M-p/2)-i ^ ^-(l/pA3/2)+. 


The rate of convergence of the estimator for /3 is now ^/nu~‘^ and is 
faster than the one in Theortem 3, when Un converges to zero. The latter 
is determined by the restriction in (6.2), which in turn is governed by the 
presence of the “residual” term Y, the variation in a and the sampling 
variation in measuring the scale via [p). For the condition to be satished 
we need p£ (1/2,1) and /?' < 1; that is, the jumps in X are of hnite variation; 
for testing the null hypothesis of presence of diffusion when the process can 
contain infinite variation jumps under the null, see the recent work of [18]. 
Without any prior knowledge on /3' and r, we can set kn according to (4.4), 
with /3 = 2, and then set Un x log(n)“^. The requirement on Un can be 
further relaxed when X is a Levy process as evident from (6.5). Finally, 
we can draw a parallel between our finding for faster rate of convergence of 
the estimator of /3 when /3 = 2 with the result in [9, 10] for faster rate of 
convergence for the maximum likelihood estimator of the stability index of 
i.i.d. /3-stable random variables when /3 = 2. 








14 


V. TODOROV 


7. Monte Carlo. We test the performance of the proposed method for 
jump activity estimation on simulated data from the following model 

(7.1) dXt = (Tt-dLt, dat =—O.OSatdt + dZt, 

where L and Z are two Levy processes independent of each other with Levy 
densities given by ul{x) = and vz{x) = 

— Zx 

0.0293 ^T 3 -l{a;>o}! respectively, u is a Levy-driven Ornstein-Uhlenbeck pro¬ 
cess with a tempered stable driving Levy subordinator. The parameters gov¬ 
erning the dynamics of a imply E((Tt) = 1 and half-life of shock in a of around 
one month (when unit of time is a day). L is a mixture of tempered stable 
processes with the parameter /3 coinciding with the jump activity index of 
X. We hx A = 0.25, and consider four cases for /3. In each of the cases we set 
Aq and Ai so that /r \x\^~^ dx = l and Ai dx = 0.2. 

The four cases are: (1) (3 = 1.05 and Aq = 0.1299, Ai = 0.0113; (2) jd = 1.25 
and ^0 = 0.1443, = 0.0125; (3) /3 = 1.50 and Aq = 0.1410, = 0.0141 

and (4) /3 = 1.75 and = 0.0975, = 0.0158. 

In the Monte Carlo we set T = 10 and n = 100 which corresponds approxi¬ 
mately to two weeks of 5-minute return data in a typical financial setting. We 
further set kn = 50 and p = 0.51. The initial estimator to construct the mo¬ 
ments and the optimal weight matrix is simply f3f^{p,u,v) with u = 0.1 and 
V = 1.1. If p > u, u)/2, then we reduce the power to p = /3-^^(p, u, u)/4. 

Based on the initial beta estimator, we estimate the values of u for which 
C{p,u,/3) = 0.95 and C{p,u,l3) = 0.25, and then split this interval in five 
equidistant regions which are used in constructing the moment vector in 
(5.4). 

Regarding the number of moment conditions, K, in the construction of 
our estimator, we should keep in mind the following. Larger K helps im¬ 
prove efficiency of the estimator as our equal weighting of the characteristic 
function within each moment condition is suboptimal. However, the feasible 
estimate of the optimal weight matrix is unstable in small samples when K 
is large. (This is similar to “curse of dimensionality” problems occurring in 
related contexts; see, e.g., [11] and [19].) Moreover, since the characteristic 
function is smooth, one typically does not need many moment conditions 
to gain efficiency. For example, we also experimented in the Monte Carlo 
with ten moment conditions (by splitting the region of u into ten equidis¬ 
tant regions). The performance of the estimator based on the ten moment 
conditions was very similar to the one based on the five moment conditions 
whose performance we summarize below. 

The results from the Monte Car^o are reported in Table 1. For com¬ 
parison, we also report results for j5{p) where p is set to the level which 
minimizes the corresponding asymptotic standard deviation in Figure 1. We 
notice satisfactory finite sample performance of /3(p, u). In all cases for /3, 
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Table 1 

Monte Carlo results 


Case 


3(p, u) 



Pip) 


Median 

IQR 

MAD 

Median 

IQR 

MAD 

/3 = 1.05 

1.0801 

0.0791 

0.0518 

1.1154 

0.0925 

0.0792 

d = 1.25 

1.3058 

0.0817 

0.0680 

1.3229 

0.1158 

0.0932 

d = 1.50 

1.5398 

0.0886 

0.0622 

1.5767 

0.1405 

0.1072 

d = 1.75 

1.7782 

0.0806 

0.0536 

1.8196 

0.1704 

0.1183 


Note'. IQR is the inter-quartile range, and MAD is the mean absolute deviation around 
the true value. The power p for j3{jp) is set to the value which minimizes the corresponding 
asymptotic standard deviation displayed in Figure 1. 


/3(p, u) contains relatively small upward biases. These biases, however, are 
well below those of /3(p). We note that the finite sample bias of /3(p, u) can 
be signihcantly reduced if, similar to /3{p), one uses an adaptive choice of 
power in the range (/3/4,/3/3). The superiority of /3(p, u) holds also in terms 
of precision in estimating /3, with inter-quantile ranges of /3(p, u) typically 
well below those of /3(p). 

8. Empirical application. We now apply the developed inference proce¬ 
dures on high-frequency data for the VIX index. The VIX index is a option- 
based measure for volatility in the market (S&P 500 index). It serves as a 
popular indicator for investors’ uncertainty, and it is used as the underlying 
asset for many volatility-based derivative contracts traded in the financial 
exchanges. Earlier work, consistent with parametric models for volatility, 
has provided evidence that the VIX index is a pure-jump ltd semimartin¬ 
gale. Here, we estimate its jump activity index. The estimation is based on 
5-minute sampled data during the trading hours for the year 2010. Like in 
the Monte Carlo, we split the year into intervals of 10 days (two weeks) and 
estimate the jump activity over each of them. The moments, the power p 
and the block size kn, are selected in the same way as in the Monte Carlo. 
Estimation results are presented in Eigure 2. The estimated jump activ¬ 
ity index takes values around 1.6. Overall, our results support a pure-jump 
specification of the VIX index. 

9. Proofs. In the proofs we use the shorthand notation !£”(•) = E(-|J-)a„) 
and Pf(-) =P(-|TiA„)- We also denote with K a positive constant that does 
not depend on n and u and might change from line to line in the inequalities 
that follow. When we want to highlight that the constant depends only on 
some parameters a and b, we write Ka^b- 
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VIX Index, Year 2010 



10 15 

Time Period 


Fig. 2. Jump Activity for the VIX Index. Estimation is done over periods of 10 days in 
the year 2010. In the estimation, moments p and are selected as in the Monte Carlo. 


9.1. Decompositions and additional notation. In what follows it is con¬ 
venient to extend appropriately the probability space and then decompose 
the driving Levy process L as follows: 

(9.1) Lt + St = St-\- St, 

where S, S and S are pure-jump Levy processes with the first two character¬ 
istics zero [with respect to the truncation function «:(•)] and Levy densities 
, 2|z/'(3:)|l{,^/(a;)<o} and |z^'(x)|, respectively. We denote the associated 
counting jump measures with ft, pti and /i 2 - (Note that there can be depen¬ 
dence between ft, fii and 

S is /3-stable process, and S and S are “residual” components whose effect 
on our statistic, as will be shown, is negligible (under suitable conditions). 
The proof of the decomposition in (9.1) as well as the explicit construction 
oi S, S and S can be found in Section 1 of the supplementary Appendix of 
[24]. 

We now introduce some additional notation that will be used throughout 
the proofs. We denote for i = -|- 3,..., n. 


i-2 


v^ip) = 


k, 


■ E ka-2)A.-nA,"s-Aj_.sr. 

' j=i-kn-l 
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1 ^ \A^S-A^_^S\P 




kr) 

j = l-kn-l 

mp)= Y 

j = i-kn-l 




[(* “ i “ 4) V 0 -I- l{j<j_3}] 


kr, 


+ 






X \A^S-AP^S\P, 


i-2 


lo'l? = 


T Y l^0-2)A„ 

We further denote the function 


= exp( - 

and direct computation yields 


C'p,/3«^k(i-2)A„-l^ 


rfi/P 


fiA^) = -kuA) --: 


p 


P \P 


We note 


(9.2) 


sup \ku{x) + kuA) + fiAx) + fi,uik\ < k 


a:GM+ 


f i,u\ 


where the positive constant depends only on u and is finite as soon as u 
is bounded away from zero. 

With this notation, we make the following decomposition for any u € M+: 

4 


ik{p,u) - C{p,u,l3) = 


1 


n — kr, —2 


znu) + z;(u) + Y,RjM 

i=i 


where Z^u) = ELfc^-^s 4 A) for j = 1,2 with 


zj (u) = cos ( u 


air-2)Ar,-A:S-/^_,S) 

{v-{p)y/p 

C'„./su^|(Tr,_9'iA„-l^' 


exp 




A;'(|!f|J7”(p))W»l 


exp 


AA{vr{p))kp )' 

C'p,/3U^|U(i_2)A„-l^ 
{\a\y/P 
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and R^{u) = EILfc^+s (^) J = 1’ 2,3,4 with 

f A?X-A. 

Tj [U) = COS I 


Tj (u) = exp - 


A/3n^|a(i_2)A„-l^ 


,1/P J ivnp)V/p )’ 


rf(u)=exp{ - 


An\V,^{p)y/P 
A0U^\(T^i_2)Ar,-f 


-exp( - 


^/3ll^|cr(i_2)A,-l^ 


V An\V^^{p)YIP /’ 

C'p,/3n^|o-(i_2)A„-l^ 


l^nHmp)Y/P J AnH\a\\V^,{p)yiP)' 

if \ ( ^P,/3ll^l'^(*-2)A„-l^^ I ^ 

A W = exp|^-- J - exp{-Cp^puP). 

We finally introduce the following; ^i(n) = (i*)) Z^’"'\u) = 

Er=fc„+ 3 ^i“’^^(ii) and = where 

= cos(nA;;;^/^/rp y^(AfS'- A"_iS')) - £(p,n,/3), 

(u) = G{p, u, /3) {A-P/^V^ (p) - 1), 

- if. 


9.2. Localization. We prove results under the following strengthened ver¬ 
sion of Assumption B: 


Assumption SB. We have Assumption B and in addition: 

(a) the processes \at\ and are uniformly bounded; 

(b) the processes 6" and are uniformly bounded; 

(c) |5"(t,x)| -|- |5'^(t,x)| < 7 (x) for all t, where ^{x) is a deterministic 
bounded function on M with Jg \'y{x)\'^~^‘' X{dx) < oo for arbitrarily small l> 0 
and some 0<r</3] 

(d) the coefficients in the Ito semimartingale representation of 6" and 6°^ 
satisfy the analogues of conditions (b) and (c) above; 

(e) the process A l)^^(dx) is bounded, and the jumps of S, S 

and Y are bounded. 


Extending the results to the case of the more general Assumption B fol¬ 
lows by standard localization arguments given in Section 4.4.1 of [12]. 

9.3. Preliminary results. The strategy of the proofs is to bound the terms 
Rj{u) for j = 1,2,3,4 as well as Z^{u) — Z'l{u) and Z^iu) — zf’^\u) — 
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{u) , and to derive the asymptotic limits of (u ), (u) and {u). 

We do this in a sequence of lemmas starting with one containing some pre¬ 
liminary bounds needed for the subsequent lemmas. 

Lemma 1. Under Assumptions A and SB and kn for w € (0,1), 
we have for 0 <p < /3, r>0 arbitrarily small and I <x < ^ and y>l, 


(9.3) A-P/%|y-(p)-i>-(p)|<iCa„, 


— 


. (2-l//3)(l-r(p-l/2)AO-0 




(9.4) 


\/ 

m-^/^Vr{p) - + E| A-P/^Fr(p) - 1| 


<K 


k 


-xjl 


if P/p> 2, 


n 5 
1—x 


(9.5) - k(,_2)A„-r)l < Kknl^n, 

(9.6) - k(,_2)A„-n" < 

(9.7) A-PF|Er_,_3(F-(p) - - vr{p))\ < KknAr,, 

(9.8) A-"F/3e|F"(p) - pl^^^\a\yr:{p) - VripT < KiknAnT^M-.^ 

(9.9) A-^F/3E|F”(p)r < 

Proof. We start with (9.3). We apply exactly the same decomposition 
and bounds as for the term A^ in Section 5.2.3 in [22] to get the result in 
(9.3). We continue with (9.4). Without loss of generality we assume kn > 2, 
and we denote the two sets 

kn 1 


Jf = < i — kn — 1 + 2k: k = 0,..., 


J° = < i — kn — 1 + 2k + 1: k = 0,. 


kn - 2 


With this notation, we can decompose Vj^{p) into 


F 


(e,n) 


1 


(p) = rT.\^u-2)An-m]s-A-_,sf 


i&J! 


yt’"\p) = K{p)-v^"'"\p)- 
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We further denote \a\li = ^J2jeJfW{j-2)A„-\^ and |cj 
l'^(j- 2 )A„-l^- Using the triangular inequality, we then have 


- pI'S 


— |P| 

'p,fi I 




'P,0 ' 


Now, since E”_ 2 |A"S' — A'j_^S\p = the sums An^^^v}^’^\p) — 

/^p ^/3 l^lei An^^^V^°'^\p) — discrete martingales. Prom 

here, the result in (9.4) for the case /3/p < 2 follows by a direct application 
of the Burkholder-Davis-Gundy inequality and the algebraic inequality 


(9.10) 




Vp G (0,1] and any real-valued {ai}i>i. 


We are left with the case filp>2. We only show the bound involving the 
term V^^’^\p), with the result for V^°’^\p) being shown analogously. We 
first denote An^^^u/''’"'^(p) - SjeJf C” where Cj = An^'^^ x 

|(T(j_ 2 )A„-l^(|A"'S' —A”_j^S'|P —Pp^^). Applying the Burkholder-Davis-Gundy 
inequality, we have 



If X <2, the result in (9.4) then follows by Jensen’s inequality. If x > 2, 
applying again Burkholder-Davis-Gundy, we have 



where we also made use of the fact that the /3-stable random variable has 
finite pth absolute moment as soon as p G (0, /3). If x < 4, the result will then 
follow from an application of (9.10). If x > 4, then we repeat (9.II) with x 
replaced by x/2 and replaced {CjY “1E”_2(C”)^. We continue in this way, 
applying k = sup{/:2® < x} times (9.II) and then (9.10). This shows (9.4). 
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We continue with (9.5) and (9.6). We make use of the following algebraic 
inequality: 

I |a -|- — \a\^ — psign{a}|a|^“^5| < Kp\a\^~‘^\b\‘^, 

for any a,b £ R with a ^ 0, 0<p<l and Kp that depends only on p. 
Applying this inequality as well as the triangular inequality, and using the 
fact that under Assumption SB the process |cr| is bounded from below, we 
have 


(9.12) 

(9.13) 


lEsdo’tT - ksDI < A'lt - s|, 0<s<t, 

Es\\at\P - < KEs{\at - a,!" V \at - ad""), 

0<s<t,q>l, 


with some constant K that does not depend on s and t. From here (9.5) 
follows. Application of Corollary 2.1.9 of [12] further gives 

(9.14) Es\(Jt — as\‘^ < K\t — 0<s<t,q>l, 


and applying this inequality with q = y and q = 2y, for y the constant in 
(9.6), we have that result. 

We proceed by showing the bounds in (9.7)-(9.9). We can decompose 


|a|? 


- W{k-2)Ar,-\^ = for k = i- kn-l,...,i-2 and 


1 


^3 

i-2 


l = r'E (k(J-2)A.-l'’-|«A.-|'’), 


0-k — 

j=k+?. 

^2 (*-A:-4)V0, 

Qfc 7 ( 

.3 


|afcA„-r - Wik-2)An-n, 


Ofc — ((kfcA„-r - k(fc-2)A„-r)l{fc<i-3} 

+ (k(fc-l)A„-r “ W{k-2)A„-\^)^{k<i-2})/kn, 

= r E {k(7-2,A„-l’’-k,t_2,A„-l'’), 

j = i-kn-l 

with a\ being zero for k>i — A. Using the law of iterated expectations and 
the bound in (9.6), we have for /c = i — — 1,..., i — 2, 

(9.15) A-"^>/%(|4 + al\\^lS- l2.l_,SrT < 2A(fe„A„)"Mi-h 

Using the Holder inequality, the bound in (9.12), as well as the fact that a 
stable random variable has finite absolute moments for powers less than /3, 
we have for A: = i — A:„ — 1,..., i — 2, 

A-"P/%(|ai + a|||A^5-A^_i5|"')" 




(9.16) 
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Combining (9.15) and (9.16), we get the results in (9.8) and (9.9). 
Further, using (9.12), we get for /c = i — 1,..., i — 2, 

(9.17) A.T-’/f |E”_t„_3((ai + afmS - AJ_,Sr)| < J7(i,A„. 

From here we get the result in (9.7). □ 


Lemma 2. Under Assumptions A and SB and kn ^ for w G (0,1), 
we have for 0 <p < f3, t>0 arbitrarily small and every 0 < a < b < oo, 

(9.18)-^--e( sup |i??(u)|) 

71 kn. 2 /il 2 


^uG[a,b] 

(9-19)---e( sup \R 2 {u)\) <Ka,ban, 

n-kn-2 \ue[a,b] ^ 

-E(^ sup |i? 3 (u)|) 


n — kn —2 


(9.20) 


(9.21>- 




< 

1 


Ka,b{{knAn)^-^ V k-^/\knAn)^AA{d-p)/^ 

KaMknAn)^AAl-.y 

-E( sup \R2{u)\) <Ka,b{knAn)^~‘', 


if l3/p> 2, 
if l3/p< 2, 


n — kn —2 

where Kafi depends only on a, and b and is finite-valued. 

Proof. We start with showing (9.18). We define the set 

Ci = {|A"^^^P”(p) - i = kn + 3,...,n, 

and then we note that 

i{c-> < i(a;7"’|v;» - vr{p)\ > 

+ i{\A-’’/i>vr{p) - pifw > y/jK). 

Hence we can apply (9.3) and (9.4) and conclude 

(9.22) e[ sup (|Ln^^)|l{cn)l ^ V A)-(/^/(2p)A(/3-p)/p)+^). 
l«eK+ “ 

We proceed with a sequence of inequalities. First, from Assumption SB, 


(9.23) 


E: 


1-2 


Next, if /3' < 1, we can decompose 

rt 


{oiu CHu—An^du 




(9.24) 


St = 


10 


xpLi{ds,dx) -t / K(x) 2 |i/'(x)|l{j,/( 3 .)<o}dx, 
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and separate accordingly ctu- dSu and ^u- dSu- For the dif¬ 

ference of the integrals against time, we can proceed exactly as in (9.23). 
Further, using the algebraic inequality in (9.10), as well as Assumption A 
for the measure v', we have 

forx</3'. 


(9.25) Er_i 


r 

JO-DAr,. JK 


au-Xfii{du, dx) 


When P' >1, we can apply the Burkholder-Davis-Gundy inequality and get 

• 'i Z\ 7] 

(9.26) / a^i-xdS^. 


r 

J (i- 




forx</3'. 


The same inequalities hold for the analogous integrals involving S. Next, 
application of the Burkholder-Davis-Gundy and Holder inequalities, as well 
as Assumption SB yields 


(9.27) Er_2 






(au- -a(i_2)^„-)Hi{x)^i{du,dx) 




Finally, denoting k'{ x) = x — k(x) and upon noting that k'(x) is zero for x 
sufficiently close to zero, we have 


(9.28) 


CiAr] 


'(2-l)Ar, 


(au- - <T{i- 2 )An-)''^'ix)Kdu,dx) 


<KAr. 


Vi>0. 


Gombining the estimates in (9.23)-(9.28), as well as the inequality | cos(x) — 
cos(y)| < 2\x — y\P for every £ M and p € (0,1], we have 


(9.29) E[sup(|ri(u)|l{(c^.)c})j < y 


^u>a 


Equations (9.22) and (9.29) yield (9.18). We continue next with (9.19). This 
bound follows from a first-order Taylor expansion of fi^uix) and the bounds 
in (9.2) and (9.3). 

We proceed by showing the result for R2(u). Using a second-order Taylor 
expansion and the Gauchy-Schwarz inequality, as well as (9.6), we get 


(9.30) 


E 


sup 

Me[a,b] 


R. 


2(u) - ^e-^' 
P 


v,P^^C. 


P,pu- 


' E i 

i = kn+^ 


< Kk„ 


where 
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Using (9.5), we have 


(9.31) 


E 




i — kn'\~2> 


< Kkn- 


Further, without loss of generality (because kn^n —t 0), we assume n > 
2kn + 3. Using the shorthand Xi=^i ~ ^ then decompose 


n 

i=kn+3 


kn+l n 


j — 1 i —2/^72 +4+ ( [(n— fcn — 2)/ (/CnH-l)J — l)(fcn +1) 


\{n-kn-2) / {kn+l)\ 

= ZZ Xfc„+3+(i-l)+(i-l)(fcn+l)) i = 1, ■ ■ •, A:n + 1- 

i=l 

Applying the Burkholder-Davis-Gundy inequality for discrete martingales 
and making use of (9.6), we have 


(9.32) E\Aj\< K{knAn)-\ j = 1, . . . ,kn + 1. 


Combining (9.30) and (9.32), we get the bound in (9.21). 

We are left with (9.20). The case /3/p < 2 follows from 

E|r3(u)| < Ka,b\A-P/f^Vr{p) - 

and by applying the bounds in (9.8)-(9.9). We now show (9.20) for the case 
/3/p > 2. We first decompose rf{u) = qHu), where 


X A-o/AiAfvl'ip) - \p\WAp) - p;fA"{p)), 
(g{u)=nA3)AA'A;fvr{p), 
elA) = ifUA - /U(A;''"'F”{p)|ffir)) 

X AA'Ap^fvAp) - Hvl(p) - p;T'^i''(p)) 

and ir is a random number between AA^^pX^VP(p) and An*’^^|ir|?V"(p). 
We further introduce 


(“) = ^„ AA''’(p;fv;'{p) - |<f|'y‘(p) - pAJ^v’-Ip)) 


\^{i-k„-3)An- \ 


'pA 


and note G{p,u,l3) = \(7(^i_2)Ar,-\^fi,uiW{i-2)Ar,-\^)- Then direct calculation 
for the function xf[^{x) and the boundedness of the process |(t| yields 


Qi{u) - q]{u)\ < Ka,b{dZ + dZ)eu 





JUMP ACTIVITY ESTIMATION VIA SELF-NORMALIZED STATISTICS 25 


where 


= |A 


-p/p- 




= Ml - k(i-2)A„-|P| + lk(i-2)A„-|P - 


'^(i— fcrt—3)A„ —I I) 

-vIPir 


a = A;'"*'if.,7T”(p) - iairE‘(p) - A.;,7v'”(p)i 


From here, we use the Holder inequality and (9.4), (9.6) and (9.8) to get 




(9.33) < 


< KM/\knAnM^^^d-p)IP-2.^ 


^ E\dPei\ < < K{knAM-^. 

For the sum using the bounds in (9.7) and (9.8), we can 

proceed exactly as for the analysis of X]iLfe„-i- 3 Ai above and split it into 
kn + ^ terms, which are the terminal values of discrete martingales. Together, 
this yields 


(9.34) 


E sup 

\ ue[a, 6 ] 


X] sliu] 


i — kn~\~S 


— Ka^b^niknAn) 


Next, using the bound in (9.9) as well as the boundedness of the derivative 
f'iuM ^ ^ [uj^])) we have 

(9.35) e( sup |^?2(n)|) 

'ue[a,b] ' 

We continue with the term gf{u). We hrst introduce the set 

£i’ = {\ (p) - I u| (p) - (p) I > 1}, i = kn + 3,...,n. 

With this notation, using (9.8) and the boundedness of the derivative /[^{x) 
(for rt E [a,b]), we have 

(9.36) e( sup lgf(u)llf£nj) <Ka,bMAny~''. 

^uela,b] " ^ 

Next using the boundedness of the second derivative //(^(x), as well as the 
bounds in (9.8) and (9.9), we get 



Combining (9.33)-(9.37), we get the result in (9.20). □ 
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Lemma 3. Under Assumptions A and SB and kn for w € (0,1), 
we have for 0 <p < f3, t>0 arbitrarily small and every 0 < a < b < oo, 


1 


n — kr, —2 


(9.38) 


sup \Z^{u) -'Z^{u)\ 

u£[a,b] 


= Op{an V A;-(/^/(2p)A(/3-p)/p)+^ y y^), 

and further if p < /3/2, 


(9.39) 


n kn 2 n£[a. 


sup \Zlf{u)-Z^^’"\u)-Z^^'"\u) 


= Op(A:-(^/2)(/3/pA3)+. ^ 


Proof. We start with (9.38). We split Z'f{u) — Z^{u) = E^{u) + E^iu) 
with Ef{u) = Er=fc„+ 3 (^*n«) -^,nw))l{Cf} and Elf{u) = Er=fc„+ 3 (^i (^) “ 
zl(u))l{(cn^c}. For ^^^(u), using Lemma 1, we easily have 

(9.40) -^--e( sup \E^{u)\)<Ka,b{an\/k-Wi^P'>^ih-p)/p)+^), 

n-kn-2 \ueia,b] > 

We proceed with Elf^u). We first note that 

(9.41) Ef_2[(4(«) -EH«))l{(cr)-}] = 0. 

Further, using the algebraic inequalities |cos(x) — cos(y)p < 2\x — y\ for 
a;,y e M and \e~^ — < 2\x — y\ for x,y € M+, as well as the definition of 

the set C”, we get 


e; 


i-2 




Applying the above two inequalities, the bounds in (9.3), (9.4) and (9.6), as 
well as the algebraic inequality 2xy <x‘^ + y‘^ for x, y G M, we have 

E{E^{u)f = E( ^2 {zliu)-zj{u))\^cm] 

\i=kn+3 / 


+ ^i ^ {4iu) - 4(«))l{(Cfp}(2l(^^) - ^](^^))l{(C7p} 


<K, 




i —fcnH“3 


< Ka,bA-\an V k-(f^/(^p)Mh-P)/p)+^ V (fe„A„) 
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1 P 

As a result, , E^ju) —> 0 finite-dimensionally in u. Finally, we 

need to show that the convergence holds uniformly in u E [a, b]. For this we 
apply a criteria for tightness on the space of continuous functions equipped 
with the uniform topology; see, for example. Theorem 12.3 of [7]. Using 
again (9.41), we have 

nEUu)-EUv)f 

<Ke( ^ {zl{u)-zjiu)-zl{v) + zj{v)fl{^c^)c\ 

\i=kn+3 J 


Hence for arbitrarily small i > 0, 
1 


n — kr, —2 


IE izj{u) - zj{u} - zj{v) + z]{v)fl{^clY^^ 




< K{\u^ — M\u — v\^ 


1 p 

and since /3 > 1, we have ^n-k -2 ^^P«g[a,&] 1-^2 (^)l —^ turn next 

to (9.39). We first introduce some additional notation. Based on a second- 
order Taylor expansion of the function fi^u{x), we can further decompose 

%(u) = + zl’”>(u), with zl‘-”l(») = 

for — a, 6, c, where (u) — z'f(u) — — zf'^^ (u) and 

- 1). 

- 1)". 


Note further that 

f k(i-2)A„-l^//,n(k(i-2)A„-l^) = G(P,U,/3), 

\ W{i-2)Azz-\‘^^fi,uiWii-2)A„-\^)=E{p,U,P). 

Direct calculation, and using the boundedness of the process a by Assump¬ 
tion SB, shows 

\HfUK) - G{p,u,P)\ + \iHffUK) - H{p,u,P)\ 

<Ka,b\\a\^- \cr(i-2)An-\^\^ [a,b],i = kn + 3,...,n, 

for some finite-valued constant Ka^ which depends only a and b. From here, 
using the bounds in (9.4) and (9.6), we have 


E 


sup I 

u£[a,b] 


M,2) 


{u)-zY\u)\) < 
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and similarly 

'^uG[a,6] '' 


Therefore, 


1 


sup 1 4"’"^ (u) - (u) + (u) - {u) 

wG[a,6] 




n — kn —2 


(9.42) 


= Op(A:-^/(2p)+. V 


We are left with Z 2 ^’^\u). Using the boundedness of the derivatives in (9.2), 
we have 

|zf’^^(u)l <Ka,b\^n^^^V^{p)-M'', 2<x<l3/pA3. 

From here, applying (9.19), we have 


(9.43) 


1 


n kn 2 u&[a,h] 


sup |Z^"’”^(u)| 


Combining the resnlts in (9.42) and (9.43), we get (9.39). □ 
Lemma 4. Let p G (0,/3/2). If kn x n'^ for w G (0,1), we have 


(9.44) 


1 I , 

Vn-k^-2 I 4“’")(u) i 


where C(u) is a Gaussian process with covariance function given by 


(9.45) 


1 

G{p,u,(3) 


E{p,u,v,l3) 


1 


G{p,v,/3)J ’ 


u, n G 1R+, 


for E{p,u,v, P) =Eq{p,u,v, f3) + 2Ei{p,u,v, /3). The convergence in (9-44) is 
in the space of continuous functions K+ —)• equipped with the local uniform 
topology. The convergence result for Zi{vl) in (9-44) continues to hold for 
pG [13/2,13). 

Further, for some i> 0, 

kn -T^ib.n) , s 

Z 2 (u) 


n — kn —2 
1 


- -^H{p,u,l3){E^Q’‘^\p,u,u,l3) + 2Ef^‘^\p,u,u,l3)) 


(9.46) 

locally uniformly in M_|_. 
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Proof. We can write 


(9.47) 

where 


■ Z'!(u) ' 


n—kn — 1 

^ Ci(u) + Ei(u) + Er(u), 

i=kn+l 


Ciiu) = 


' cos(nA-'/^/r;/(Ar5 - A^.S)) - C{p, u, (3 )' 


Gip,u,l3)[A-^/f^p;f\A^S - - 1 ] 


Ei{u) = 'f2 


i —1 


i=2 

n—2 


0 

*•( 2 ), 


Eriu) = 


^ kn vcr^w 

n — 1 — i 


^^1+2 / >(!)/ \ 
, ^ ( Q ’{u) 

i=kn + l 


i=n—kn 

We note that for u G M+, 


kn 


0 

<?3u) 


E 

i=n—kn 




(9.48) Er_2(C(^^))=0, i = 2,...,n. 

Further, making using of the ineqnality | cos(x) — cos(y)| < 2\x — y\^ for every 
p G (0,1] and x, y G M, we have for rt, u G M+, 

(9.49) E^_2{Ci^\u) — < K\u — v\^ W\u^ — v^f, l<p<f3. 

(2) 

Making use of (9.48) and the fact that Q (u) depends on u only through 
H{p,u,(3) and sup„g]R+ \H{p,u, I3)\ is a finite constant, we have 

(9.50) ^E(sup|£;K^)ld 

Making use of (9.49) and the differentiability of G{p,u,l3) in u, we also have 

^E{Er{u) - Er{v)f < \E{u) - E{v)\P, 

for some increasing function E{-) and some p > 1. Applying then a criteria 
for tightness on the space of continuous functions equipped with the uniform 
topology (see, e.g.. Theorem 12.3 in [7]) as well as making use of the fact 
that knAn 0, we have locally uniformly in u, 

(9.51) \ Er{u)^0. 

y/n — kn — 2 

We are left with the first term on the right-hand side of (9.47). First, we 
establish convergence for this term finite-dimensionally in u. We have the 
decomposition 

n—kn — 1 n—kn — 1 n—kn—2 

Y. C.W= E K.(a)-E?-i(Ci(«)))+ E E?K<+i(«)). 

i = kn + l i=kn + l i=kn 
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From here, we can apply a c.l.t. for triangular arrays (see, e.g.. Theo¬ 
rem 2.2.13 of [12]) to establish that Yl'iZkn+i converges finite- 

dimensionally in u to C(u). This convergence holds also locally uniformly in 
u using the bound in (9.49) and Theorem VI.4.1 in [14]. Combining the lat¬ 
ter with the asymptotic negligibility results in (9.50) and (9.51), together 
with the fact that kn/n —)• 0, we have the result in (9.44). Furthermore, since 
Zi{w) depends on p only through the marginal convergence in (9.44) 
involving [^(u) holds for any p G (0,/3). 

We turn next to (9.46). We denote 

Xi = {p) -if - (p, u, u, 13) + 2sf(p, u, u, /?)), 

/o o\ fO O') 

and we note that Hg ’ {p,u,u,/3) and H) ’ {p,u,u,/3) do not depend on u. 

Without loss of generality we can assume n > 2kn + 3, and then we set 


^3= X] Xfc„+3+(i-l)+(i-l)(fcn-Hl)> j = 1, • • • + 1- 

i=l 

Since Ejyj] < K, 


(9.52) 


n kn+l 

^ Xi-Yl 

i=kn+3 j=l 


Op(kn)- 


Further, direct computation shows 


E”_fc^_ 3 (Xi) =0, i = A:n + 3,...,n, 

and applying the Burkholder-Davis-Gundy inequality for discrete martin¬ 
gales, we have 


(9.53) 


EjA.r < l<x<f. 

Zp 


Using inequality in means we further have 




kn + 


i=i 




1 '•'/t I 1 ‘''TL I 

<-f—y\A 

+ l ^ ^ - k„ + l ^ ■ 


X 


1 < X < 


i=i 


£ 

2p 


Applying the above inequality with x sufficiently close to j3/{2p) and the 
bound in (9.53), we have Aj -*^0, and to¬ 

gether with the result in (9.52), this implies (9.46). □ 
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9.4. Proofs of Theorems 1 and 2. Theorem 1 and (4.5) of Theorem 2 

follow readily by combining Lemmas 1-4 [and nsing (9.4) for bounding Z^iu) 
in the proof of Theorem 1]. To show (4.7), we note first that H{p,u,fi) and 
Ei{p,u,u, P), for i = 0,l, are continuously differentiable in /3. For H{p,u,/3) 
this is directly verifiable, and for Hj(p, u, u, /3) with z = 0,1, this follows from 
the continuous differentiability of the characteristic function /3 —)■ for 

u € R+. Moreover, the derivative (3) is bounded in u. Prom here, 

(4.7) follows from an application of the continuous mapping theorem. 

9.5. Proof of Theorem 3. We denote the true value of the parameter /3 
with Pq. Then the claim in (5.7) will follow if we can show the following: 

(9.54) m(p,u,/3-^^,u,/3)m(p,u,;0) uniformly in/3 G [1,2], 
where m(p, u,/3) is defined via 

in{p,u,P)i = / (log(£(p,u,/3o))-log(£(p,u,/3)))du, 

Ju\ 

(9.55) Vraffl(p,u,^■^^u,/3o) W^/^(p,u,/3o) x N, 

where N is 77 x 1 standard normal vector and 

(9.56) M(p, u,/3) —)• M(p, u,/3) uniformly in a neighborhood of Pq. 

This is because m(p, u, /3) = 0 if and only if /3 = /3o and W{p, u, Pq) is positive 
dehnite. 

We start with (9.54). We have 

[ log{C{p,u,P))du^ [ log{C{p,u,P))du 

Ju^l Ju'^ 

uniformly in /3 G [1,2] for i = 1,..., 77 because of u; —> u; and Uh —> Uh 
as well as the continuity of the function in P for every u G M_|_, and 
the argument can be used to show (9.56). To show (9.54) it remains to show 

log(£"'(p, u, P^^Y) du log(£(p, u, Po)) (7u for i = 1,..., 77. Due the 

continuous differentiability of the de-biasing term in /S, ^ /3o and the 

asymptotic boundedness of u; and and of C'^{p,u, P^^f from below, we 

have /^i'*[log(£”(p,u,/3'^^)') — log(£"'(p, u,/3o))] du0. From here (9.54) 
follows by applying Theorem 1. 

We are left with (9.55). This result follows from applying the uniform 
convergence of C'^{p,u, P-^^f in Theorem 2. 

Finally, (5.8) follows from the continuity of G{p,u,P) and W“^(p, u,/3) 
in u and p. 
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9.6. Proof of Theorem f. We will use the shorthand notation Vn = pUn- 
We start with the following lemma. 

Lemma 5. Under the conditions of Theorem f we have 

(9.57) - C(jp,Un,P'') = 

(9.58) 


where 


y2 — 2;2 


” Z„,^,r^Zi--Cp,2Z2, 


24a 


p,2 P 


Zn = T;-5-(£"(p,Un,^^*)' - C{p,Un,d^'')) 




Cp,2vl 


i£^{p,VnJh’-J^iP,Vn,Pn). 


Proof. We use the same decomposition of C'^{p,u,/3) — C{p,u,/3) as in 
the proofs of Theorems 1 and 2. We start with the leading terms Zi{un), 
Z^’"'\un) and zf’'^\un)- Using Taylor’s series expansion, we have for any 
u G M+ and Z G M, 


cos(uZ) — 1 = — 


u^Z^ u^Z^ 


+ 


24 


+ R(uZ), \R{uZ)\< K\uZf 


l — e “* = — ^ + 0(u®) asu—)-0. 

Using this approximation we have (note that when Lt is a Brownian motion, 
then = 1 and so Cp^p = l/pp^p) 


(9.59) 


Cp^2Un 




Cp,2V^. 

l-vl ^ 


E 


24Up,2 . * 

2 — /Cti+O 


//2 
Up,2 


12 

Up,2 


+ Op{u^y/n). 


We similarly get 


1 -^(a,n) 


Cp,2U' 


2^2 

'n 


{An) 


1 -Fy{a,,n) 


Cp2vl 2 


(^n) 


n 


ivn-ul)-Cp,2 {Aj/^V^{p)-l) + Op{ulVn), 

i=kn-\S 


(9.60) 

















JUMP ACTIVITY ESTIMATION VIA SELF-NORMALIZED STATISTICS 33 


and also 


kr, 


n — kn — 2 




(9.61) 


i=kn+3 

- {E’'Q'^\p,Un,Un,2) + 2E^^'‘^\p,Un,Un,2)) 


— kriAn)- 

As in Lemma 4, it is easy to show 

n'^iAfS - 


12 \ 


(9.62) 


-Yn — kn —2 , 


E 

2 — /Cti + S 


^P ,2 


/^p ,2 


A-^/vr(p) -1 / 


^2 


Next, using Taylor’s expansion as well as (5^^ — 2 = Op{kn we have 


(9.63) 


ulkn 


{H{p,Un,Y^Y - H{p,Un,2)) =Op{l). 


We proceed with the rest of the terms in the decomposition of C^ip, Un, (3) — 
C{p,Un,/3) and C'^{p,Vn, Y) — C{p,Vnf3)- We start with the term Ri{un)- It 
relies on the bound in (9.3), which in turn depends on the analysis of the 
term A 3 in Section 5.2.3 of [22]. When L is a Brownian motion, the bounds 
for this term get slightly changed. In particular, the bound in equation (41) 
of that paper becomes now KA]^'' for g > r V 1 (this follows by using integra¬ 
tion by parts and the Burkholder-Davis-Gundy inequality) and arbitrarily 
small i > 0. Using this, it is easy to show that when L is a Brownian motion, 
the bound in (9.3) holds with an replaced by /3n, where 


Yn = 


. (3/2)(l+(p-l/2)A0-d 




Now the bound for R'^{un) becomes 

RUUn) 


(9.64) 


E 


nut: 


<K 


Pn V 


Further, using the same steps as in the proofs of Lemmas 1-3, as well as 


sup {\u\P\fln{x)\ + \u\‘^^\fi,u{x)\) < 00, 

u,xGR^ 


we get 
(9.65) E 



<Kj3nuJ, 

E 

Ri{Un) 

nui 



nui 


< K{knAr 
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(9.66) E 


R^{Un) 


nut 


< V fc-V2(fc„A„) 


(9 6^) E|Zr(..)-^(..)| <^ 


nut 


iPn V 


Ut 




E|%(n„) - Z^^'"\un) - Z^^'"\un)\ 


(9.68) 


nu; 


<K 


L-VP+'' 

f^n 


V fe-3/2+^ V A:-^/2(^^^^)l/rA(2-p)/2-. 


Combining the bounds in (9.64)-(9.68), together with (9.59)-(9.61), the 
result in (9.62) and (9.63), we establish Lemma 5. We further note that 
when X is a Levy process, R^{u) and R2{u) are identically zero. □ 


We proceed with the proof of Theorem 4. Using Taylor’s expansion and the 
result in (9.57), Z„, defined in the statement of Lemma 5, is asymptotically 
equivalent to 

r<^ 2 Un, P")') - Cp^2ul) 

Cp,2K 

- 7T^(-log(£”(p,n„,^^^)') - Cp^2Vn)- 
Cp,2Vi 

Using again Taylor’s series expansion, the result in (9.57) and that u~‘^y /—)• 
0, we have that the above is asymptotically equivalent to 

(log(- log(£’"(p, Un, ^‘^'*)')) - log(C'p, 2 Wn)) 

-(log(-log(£”(p,nn,^-^®)')) - log(C'p,2n^)). 

From here result (6.4) in Theorem 4, both in the general and Levy case, 
follows from Lemma 5. 
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